Unsupervised News Video Segmentation by Combined Audio-Video Analysis

نویسندگان

  • Massimo De Santo
  • Gennaro Percannella
  • Carlo Sansone
  • Mario Vento
چکیده

Segmenting news video into stories is among key issues for achieving efficient treatment of news-based digital libraries. In this paper we present a novel unsupervised algorithm that combines audio and video information for automatic partitioning news videos into stories. The proposed algorithm is based on the detection of anchor shots within the video. In particular, a set of audio/video templates of anchorperson shots is first extracted in an unsupervised way, then shots are classified by comparing them to the templates using both video and audio similarity. Finally, a story is obtained by linking each anchor shot with all successive shots until another anchor shot, or the end of the news video, occurs. Audio similarity is evaluated by means of a new index and helps to achieve better performance in anchor shot detection than pure video approach. The method has been tested on a wide database and compared with other state-of-the-art algorithms, demonstrating its effectiveness with respect to them.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Combining Audio-Based and Video-Based Shot Classification Systems for News Videos Segmentation

In this paper we propose an innovative combination strategy for a system using video and audio stream of a news video to automatically segment it into stories. In our approach, the segmentation is performed in two steps: first, shots are classified by combining three different anchor shot detection algorithms using video information only. Then, the shot classification is improved by using a nov...

متن کامل

Unsupervised video-shot segmentation and model-free anchorperson detection for news video story parsing

News story parsing is an important and challenging task in a news video library system. In this paper, we address two important components in a news video story parsing system: shot boundary detection and anchorperson detection. First, an unsupervised fuzzy -means algorithm is used to detect video-shot boundaries in order to segment a news video into video shots. Then, a graph-theoretical clust...

متن کامل

Unsupervised and Model-Free News Video Segmentation

Based on a simple temporal structural model of news program, this paper presents a practical solution to automatic news story segmentation by integrating syntactic and semantic methods. First, a syntactic segmentation method is used to detect the shot boundaries in order to partition video frames into video shots. Then a semantic segmentation method based on the graph-theoretical cluster analys...

متن کامل

Audio-Video Based Segmentation and Classification using AANN

This paper presents a method to classify audio-video data into one of seven classes: advertisement, cartoon, news, movie, and songs. Automatic audio-video classification is very useful to audio-video indexing, content based audio-video retrieval. Mel frequency cepstral coefficients are used to characterize the audio data. The color histogram features extracted from the images in the video clips...

متن کامل

Audio-visual segmentation for content-based retrieval

This paper reports recent work at ORL on segmentation of digital audio/video recordings. Firstly, we describe an audio segmentation algorithm that partitions a soundtrack into manageably sized segments for speech recognition. Secondly, we present an algorithm for detecting camera shot-break locations in the video. The output of these two algorithms is combined to produce a semantically meaningf...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2006